Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Text enhancement in digital video

Identifieur interne : 002029 ( Main/Exploration ); précédent : 002028; suivant : 002030

Text enhancement in digital video

Auteurs : HUIPING LI [États-Unis] ; O. Kia [États-Unis] ; David Doermann [États-Unis]

Source :

RBID : Pascal:99-0297282

Descripteurs français

English descriptors

Abstract

One difficulty with using text from digital video for indexing and retrieval is that video images are often in low resolution and poor quality, and as a result, the text can not be recognized adequately by most commercial OCR software. Text image enhancement is necessary to achieve reasonable OCR accuracy. Our enhancement consists of two main procedures, resolution enhancement based on Shannon interpolation and text separation from complex image background. Experiments show our enhancement approach improves OCR accuracy considerably.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Text enhancement in digital video</title>
<author>
<name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Language and Media Processing Laboratory, Institute for Advanced Computer Studies, University of Maryland</s1>
<s2>College Park, MD 20742-3275</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
<settlement type="city">College Park (Maryland)</settlement>
</placeName>
<orgName type="university">Université du Maryland</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<placeName>
<settlement type="city">College Park (Maryland)</settlement>
<region type="state">Maryland</region>
</placeName>
<orgName type="university" n="3">Université du Maryland</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">99-0297282</idno>
<date when="1999">1999</date>
<idno type="stanalyst">PASCAL 99-0297282 INIST</idno>
<idno type="RBID">Pascal:99-0297282</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000823</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000B71</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000758</idno>
<idno type="wicri:doubleKey">1017-2653:1999:Huiping Li:text:enhancement:in</idno>
<idno type="wicri:Area/Main/Merge">002139</idno>
<idno type="wicri:Area/Main/Curation">002029</idno>
<idno type="wicri:Area/Main/Exploration">002029</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Text enhancement in digital video</title>
<author>
<name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Language and Media Processing Laboratory, Institute for Advanced Computer Studies, University of Maryland</s1>
<s2>College Park, MD 20742-3275</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
<settlement type="city">College Park (Maryland)</settlement>
</placeName>
<orgName type="university">Université du Maryland</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<affiliation wicri:level="2">
<inist:fA14 i1="02">
<s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<placeName>
<settlement type="city">College Park (Maryland)</settlement>
<region type="state">Maryland</region>
</placeName>
<orgName type="university" n="3">Université du Maryland</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
<imprint>
<date when="1999">1999</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Document analysis</term>
<term>Document image processing</term>
<term>Document retrieval</term>
<term>Image restoration</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Traitement image document</term>
<term>Restauration image</term>
<term>Reconnaissance forme</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche documentaire</term>
<term>Analyse documentaire</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Recherche documentaire</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">One difficulty with using text from digital video for indexing and retrieval is that video images are often in low resolution and poor quality, and as a result, the text can not be recognized adequately by most commercial OCR software. Text image enhancement is necessary to achieve reasonable OCR accuracy. Our enhancement consists of two main procedures, resolution enhancement based on Shannon interpolation and text separation from complex image background. Experiments show our enhancement approach improves OCR accuracy considerably.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Maryland</li>
</region>
<settlement>
<li>College Park (Maryland)</li>
</settlement>
<orgName>
<li>Université du Maryland</li>
</orgName>
</list>
<tree>
<country name="États-Unis">
<region name="Maryland">
<name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
</region>
<name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002029 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002029 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:99-0297282
   |texte=   Text enhancement in digital video
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024